Bridge Surface Crack Detection Algorithm Based on YOLOv3 and Attention Mechanism
CAI Fenghuang1,2, ZHANG Yuexin1,2, HUANG Jie1,2
1.College of Electrical Engineering and Automation, Fuzhou University, Fuzhou 350108; 2.Key Laboratories of Industrial Automation Control Technology and Information Processing of Fujian Province, Fuzhou University, Fuzhou 350108
Abstract:To realize fast and accurate detection of bridge surface cracks for the timely repair, a bridge surface crack detection algorithm based on improved YOLOv3 (Crack-YOLO) is proposed. Crack-YOLO is combined with depthwise separable convolutions and attention mechanism to detect bridge surface cracks in real time. The standard convolution of YOLOv3 is replaced with the depthwise separable convolution to reduce the number of network parameters. Moreover, the inverted residual block of MobileNet V2 is introduced to solve the problem of precision decline caused by depthwise separable convolution. In Crack-YOLO, both channel attention and spatial attention of the image are taken into account through the convolution block attention module to learn the feature selectively. The experimental results show that Crack-YOLO detects the cracks on the surface of the bridge in real time. Compared with YOLOv3, Crack-YOLO produces smaller weights and higher detection accuracy at a higher detection speed.
[1] 《中国公路学报》编辑部.中国桥梁工程学术研究综述: 2014.中国公路学报, 2014, 27(5): 1-96. (Editorial Department of China Journal of Highway and Transport. Review on China's Bridge Engineering Research: 2014. China Journal of Highway and Transport, 2014, 27(5): 1-96.) [2] BALAKUMARAN S, WEYERS R E, BROWM M C. Influence of Cracks on Corrosion Initiation in Bridge Decks. ACI Materials Journal, 2017, 114(1): 161-170. [3] 黄军生. 钢筋混凝土桥梁裂缝成因综述.世界桥梁, 2002(2): 59-63. (HUANG J S. Summary of the Causes of Cracks in Reinforced Concrete Bridges. World Bridges, 2002(2): 59-63.) [4] ABDEL-QADER I, ABUDAYYEH O, KELLY M E. Analysis of Edge-Detection Techniques for Crack Identification in Bridges. Journal of Computing in Civil Engineering, 2003, 17(4): 255-263. [5] 阮小丽,王波,荆国强,等.桥梁混凝土结构表面裂缝自动识别技术研究.世界桥梁, 2017, 45(6): 55-59. (RUAN X L, WANG B, JING G Q, et al. Study of Automatic Identification Technology for Surface Cracks in Bridge Concrete Structures. World Bridges, 2017, 45(6): 55-59.) [6] ZALAMA E, GÓMEZ-GARCÍA-BERMEJO J, MEDINA R, et al. Road Crack Detection Using Visual Features Extracted by Gabor Filters. Computer-Aided Civil and Infrastructure Engineering, 2014, 29(5): 342-358. [7] KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet Classification with Deep Convolutional Neural Networks // PEREIRA F, BURGES C J C, BOTTOU L, et al., eds. Advances in Neural Information Processing Systems 25. Cambridge, USA: The MIT Press, 2012: 1097-1105. [8] SHAN H, ZHANG J P. Randomized Distribution Feature for Image Classification // Proc of the 22nd European Conference on Artificial Intelligence. Berlin, Germany: Springer, 2016: 426-434. [9] 李良福,马卫飞,李丽,等.基于深度学习的桥梁裂缝检测算法研究.自动化学报, 2019, 45(9): 1727-1742. (LI L F, MA W F, LI L, et al. Research on Detection Algorithm for Bridge Cracks Based on Deep Learning. Acta Automatica Sinica, 2019, 45(9): 1727-1742.) [10] ZHANG L, YANG F, ZHANG Y D, et al. Road Crack Detection Using Deep Convolutional Neural Network // Proc of the IEEE International Conference on Image Processing. Washington, USA: IEEE, 2016: 3708-3712. [11] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation // Proc of the IEEE Conference on Computer Vision and Pa-ttern Recognition. Washington, USA: IEEE, 2014: 580-587. [12] GIRSHICK R. Fast R-CNN // Proc of the IEEE International Conference on Computer Vision. Washington, USA: IEEE, 2015: 1440-1448. [13] REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. [14] REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, Real-Time Object Detection // Proc of the IEEE Conference on Computer Vision and Pattern Recognition.Washington, USA: IEEE, 2016: 779-788. [15] REDMON J, FARHADI A. YOLO9000: Better, Faster, Stronger // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2017: 6517-6525. [16] REDMON J, FARHADI A. YOLOv3: An Incremental Improvement[C/OL]. [2020-06-11].https://arxiv.org/pdf/1804.02767.pdf. [17] 李庆忠,李宜兵,牛炯.基于改进YOLO 和迁移学习的水下鱼类目标实时检测.模式识别与人工智能, 2019, 32(3): 193-203. (LI Q Z, LI Y B, NIU J. Real-Time Detection of Underwater Fish Based on Improved YOLO and Transfer Learning. Pattern Recognition and Artificial Intelligence, 2019, 32(3): 193-203.) [18] HOWARD A G, ZHU M L, CHEN B, et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications[C/OL].[2020-06-11]. https://arxiv.org/pdf/1704.04861.pdf. [19] SANDLER M, HOWARD A, ZHU M L, et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks // Proc of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2018: 4510-4520. [20] WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional Block Attention Module // Proc of the European Conference on Computer Vision. Berlin, Germany: Springer, 2018: 3-19. [21] HE K M, ZHANG X Y, REN S Q, et al. Deep Residual Learning for Image Recognition // Proc of the IEEE Conference on Computer Vision and Pattern Recognition. Washington, USA: IEEE, 2016: 770-778.